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Abstract. We prove a variation norm Carleson theorem for Walsh-Fourier series of func- 
tions with values in a UMD Banach space. Our only hypothesis on the Banach space is 
that it has finite tile-type, a notion introduced by Hytonen and Lacey. Given q e [2, oo) 
we show that, if the space X has tile-type t for all z e (q, oo) then the /"-variation of the 
Walsh-Fourier sums of any function / e L p ([0, 1); X) belongs to LP , whenever q < r < oo 
and (r/(q - 1))' < p < oo. We also show that if this conclusion is true for a variant of the 
variational Carleson operator then the space X necessarily has tile-type T for all t > q. For 
intermediate spaces, i.e. spaces X = [Y,H]g which are complex interpolation spaces between 
some UMD space Y and a Hilbert space H, the tile-type is q = 2/6. We show that in this case 
the variation norm Carleson theorem remains true for all r > q in the larger range p > (2r/q)' . 



1. Introduction 

The celebrated theorem of Carleson on the pointwise convergence of Fourier series, 
[5], asserts that the partial sums of the Fourier series of an L7-function converge almost 
everywhere to the function for 1 < p < oo. The usual strategy to prove this result is to 
show that the Carleson operator, that is, the maximal partial sums of the Fourier series of a 
function, is bounded on the corresponding I/-space. Subtle refinements of the Carleson 
theorem [8] involve more refined notions than the ^°°-norm of the partial sums. One of 
these notions is the variation norm Carleson theorem, proved in [19], which in particular 
shows the pointwise convergence of the Fourier series of / without having to resort to a 
dense subset where this convergence can be easily exhibited. A weighted version of the 
corresponding result for Walsh-Fourier series has recently appeared in [10]. 

We point out that, in the scalar case, variational bounds for the Hilbert transform as well 
as for more general Calderon-Zygmund operators have been obtained for example in [3] 
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and [4]. In [14] the authors study more singular operators such as averages along lower 
dimensional sets and truncations of singular integrals with rough kernels. However, there 
is no complete characterization of the Calderon-Zygmund kernels, say, that give rise to 
operators that obey variational bounds. The question is subtle since a variational bound 
immediately implies the pointwise converge of the truncations of singular integrals and 
it is known that this convergence fails for some operators. In the vector-valued setup the 
information is even scarcer. For example it is natural to ask whether the Hilbert transform 
of a function with values in a UMD Banach space obeys variational bounds. That would 
be the vector-valued analog of the main result in [14] and, to the best of our knowledge, it 
is not currently known. 

The first results on Carleson's theorem for vector-valued functions appear in [24] and 
[23] with the additional hypothesis that the target Banach space X is a UMD space with an 
unconditional basis, or more generally, that it is a UMD lattice. The corresponding result 
for Walsh-Fourier series was proved for UMD lattices in [27]. Recently in [20], Parcet, Soria 
and Xu proved the "little Carleson theorem" in general UMD spaces, thus taking a step 
away from the lattice hypothesis. In this paper we take up the investigation initiated in [12] 
and continued in [11,13], concerning the vector-valued extensions of Carleson's theorem 
and related issues, where the lattice assumption is completely avoided. In particular, this 
paper is very much in the spirit of [12] since we study the variant of Carleson's theorem, 
due to Billard [2], for Walsh-Fourier series. 

In particular we prove a variation norm version of Carleson's theorem for Walsh-Fourier 
series of vector-valued functions in the spirit of [10, 19]. A necessary condition for this type 
of results is known to be that the target Banach space X has the UMD property: X-valued 
martingale differences converge unconditionally in the space. See for example [11,23,24]. 
Furthermore, a key hypothesis in this paper is the tile-type hypothesis, introduced in [12]. 
This is a hypothesis with a probabilistic flavor on the geometry of X, which is easily seen 
to be stronger than usual cotype, although we do not know at this moment if it is strictly 
stronger than the hypothesis of finite cotype together with the UMD property. 

The tile-type hypothesis has been successfully used in [12] where it is shown that finite- 
tile type implies that the partial sums of Walsh-Fourier series converge to the function 
almost everywhere. Another hypothesis of similar flavor has been used in [11] to show 
the corresponding result in the trigonometric case. In these two works, the prototypical 
examples of UMD Banach spaces with finite tile-type are the intermediate spaces X = [Y, H] g , 
that is, spaces X which are complex interpolation spaces between some UMD Banach space 
Y and a Hilbert space H. This class of Banach spaces includes for example all the UMD 
lattices but also the Schatten ideals C p , 1 < p < oo, and in general all examples of UMD 
spaces that are currently known, which are not necessarily function lattices. However the 
tile-type turns out to be a qualitative hypothesis for the Carleson theorem: the exact value 
of the tile-type is irrelevant, it is only needed that it is finite. This changes dramatically 
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when one looks at models for vector-valued bilinear Hilbert transforms, [13]. There the 
exact value of the tile-types of the Banach spaces involved play a crucial role in the range of 
inequalities one can prove, and in fact one needs to stay 'close enough' to the Hilbert space 
tile-type q = 2 to get any bounds on the Walsh model of the bilinear Hilbert transform 1 . 

The present paper, besides bringing another result in the vector-valued setup, has a 
different objective: to relate the tile-type of the Banach space to the boundedness of the 
Carleson operator in a quantitative manner. Although this is not possible in the usual 
formulation of the Carleson theorem, the variational variant gives the correct framework 
for such a quantification. Furthermore, in this setup, one can prove a partial converse. 
Roughly speaking we can show that if a variation norm version of the Carleson operator is 
bounded within a certain range of exponents then necessarily the target space X has finite 
tile-type whose exact value is directly related to the variation index. Thus the variation 
index, which can be viewed as a quantification of the rate of convergence of the partial 
Walsh-Fourier series, is intimately related to the tile-type of the UMD Banach space under 
consideration. The partial converse described above is a step towards characterizing the 
UMD Banach spaces for which Carleson's theorem is true. In particular we know that 
tile-type implies Carleson's theorem. However we now see that the tile type condition 
implies, and essentially characterizes, the validity of a variation norm Carleson theorem 
on a UMD Banach space. The question whether Carleson's theorem is valid on an arbitrary 
UMD Banach space remains open however. Our method and general strategy of proof is 
along the lines of [10,12,19], using the time-frequency analysis techniques and arguments 
introduced in [15]. 

The first thing we need to do in order to formulate our main results is to describe what 
is the variation norm of a sequence. This norm plays a central role throughout the paper. 

Let 1 < r < oo and x = {x n } n eN be a sequence of elements of a Banach space (X, | ■ |). The 
r-variation of the sequence x is defined as 



with the usual modification when r = oo. Observe that the "V^-norm is essentially the 
^°°-norm. If {/„} )i£ ]N is a sequence of X-valued functions, /„ : R+ — > X, we will also write 



|x||'V r (X) : - 



sup sup > \x N 



K N <-<N K 





There is another striking detail, that one can formulate vector-valued results that include non-UMD 
Banach spaces; see Silva [25]. 
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Now for / e I/([0, 1); X) for some 1 < p < oo consider the partial Walsh sums 



N-l 



S N f(x) := ^(f,w n )w n (x), 



n=0 

where w n is the n-th Walsh function on [0, 1). Precise definitions will be given in Section 3. 

The variational norm version of Carleson's theorem for vector-valued Walsh series is: 

1.1. Theorem. Given q e [2, oo), let X be a Banach space which has tile-type t for all x > q. 
Suppose that 

1 1 o-2 

q <r < oo, 0<-< 1 . 

p r' r 

Then for every f e LP([0, 1); X), 

I|Sn/IIlp([0,1);'V(X)) ^ II/IIlp([0,1);X)/ 

with the implicit constant depending only upon p, r, q and the space X. 

Observe that for q = 2 the restriction on the integrability exponent becomes p > r' which 
is necessary in the scalar case of the Fourier analog of Theorem 1.1, [19]. Here we get a 
condition that becomes more stringent as X 'moves away' from the Hilbert space case, 
quantified by the tile-type. 

In the special case that X is an intermediate space, that is X = [Y,H] is a complex 
interpolation space between some UMD Banach space Y and a Hilbert space H, and 
< 8 < 1, it was shown in [12] that X has tile-type q = 2/6. Thus Theorem 1.1 immediately 
applies to all intermediate spaces of this type. However, arguing directly by interpolation 
we get a slightly stronger theorem: 

1.2. Theorem. Suppose that X := [Y,H] e , < < 1, is a complex interpolation space between a 
UMD Banach space Y and a Hilbert space X. Set q :=2/d. Suppose that 

n 1 1 1~ 2 
q <r < oo, 0<-< ■ . 

' p r' 2r 

Then for every f e L p ([0, 1); X) we have that 

\\SNf\\u'(ioA)fV r (M ~ II/IIlp([o,i);X)- 
where the implicit constant depends only on p, r, q and the space X. 

So far the only Banach spaces which are known to have finite tile-type are exactly the 
complex interpolation spaces of Theorem 1.2. From this point of view, the weaker Theorem 
1.1 seems uninteresting compared to Theorem 1.2. However, the formulation of Theorem 
1.1 only assumes finite tile-type. This provides an indication that the finite tile-type of 
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a Banach space X could be a strictly weaker hypothesis than that of X being a complex 
interpolation space. Furthermore, Theorem 1.1 has a partial converse. With details to be 
explained later on in the paper we show that if the conclusion of Theorem 1 .1 is true for the 
operator C )P , a linearized version of the variational Carleson operator, then X necessarily 
has tile-type % for all t > q. Furthermore it is easy to see that tile-type q implies usual 
cotype q so cotype t for all % > q is a necessary condition for the boundedness of C t/ p in the 
range of Theorem 1.1. 

The Fourier scalar variational Carleson theorem is related to a number of topics, includ- 
ing variational estimates for singular integrals [14]; refined estimates in ergodic theory 
[7,8]; maximal inequalities [18], and approaches to extensions of results of Christ-Kiselev 
[6] in spectral theory, see [19, Appendix C]. Some of these continue to be under active 
development [9, 17]. At some point, these topics might be ripe for investigation in the 
vector valued case. 

The rest of the paper is organized as follows. In section 3 we introduce tiles and 
trees in the time-frequency plane and define the corresponding wave packets in terms of 
appropriate Walsh functions. In section 4 we review the definition of the tile-type of a 
Banach space, adjusted to the needs of the present paper. This definition is slightly weaker 
than the one in [12] but philosophically it is the same. We also discuss several structural 
properties of the trees that one needs to consider in the definition of the tile-type and show 
that tile-type q implies martingale cotype q for any Banach space X. We also recall the 
vector-valued version of Lepingle's inequality which will play an important role later on 
in the proof. In section 5 we reduce the original Theorem 1.1 to a statement about a variant 
of the variational Carleson operator. For this variant we prove that the tile-type hypothesis 
is sharp. We eventually linearize all our operators and reduce the main theorem to the 
statement of Proposition 5.3. In Sections 6 to 8 we introduce all the necessary machinery 
from time-frequency analysis and prove Proposition 5.3. The last section 9 is devoted to 
the proof of Theorem 1.2. The proof uses complex interpolation between the full range 
of r-variation inequalities, valid in any Hilbert space, and the co-variation bounds for 
intermediate spaces from [12]. 



2. Notation 

Throughout the text c, C will denote generic positive constants that might change even 
in the same line of text. We write A < B if A < cB for some numerical constant c > 0. 
In this paper the implicit constants that appear in various estimates may depend on the 
variation index r, the integrability index p, the tile-type index q and the space X itself, but 
we typically suppress this dependence since it is of no importance. We denote by N the 
set of non-negative integers and by IR+ the set of non-negative real numbers. Finally the 
dyadic intervals on the positive real line are denoted by T). These are the intervals of the 
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form [n2 k , (n + l)2 k ) where neN and k e Z. For any interval co with endpoints a < b we 
use the standard notation co := (a, b) for its interior as well as the notations [co) := [a, b) 
and (co] := (a, b]. If no special notation is used by convention we use co = [co) = [a, b). We 
denote by E* the conditional expectation with respect to dyadic intervals of length 2 k : 



leO,\l\=2 k 

the dyadic maximal function is then Mf(x) := sup k \Ekf(x)\. Finally we denote by BMO the 
dyadic BMO space on the positive real line, equipped with the norm 



BMO(R+;X) - 11/llBMO 



:=SU P^ \\fW ~ (f)i\dx, 

IeD l J l J I 



where </>/ := jjf Jf /• Note that, throughout the text, the notation | • | is used both for the 
absolute value as well as for the norm of the Banach space X, depending on context. 

3. Walsh wave packets, Tiles and Trees 

A tile P is a dyadic rectangle of area 1 in the time-frequency plane R+ x R+, namely 

1 

P = I P x cop = I P x — [n, n + 1), neN, I P e T), 
\Ip\ 

If P,P' are tiles we write P < P' if Ip c and cup c cup. Likewise, a bzffZe P is a dyadic 
rectangle of area 2: 

2 1 1 

P = IpXcop = I P x —[n,n + 1) = I J J P x 7—.[2n + v,2n + v + 1) =: P ri UP„. 

|ip| ^ lipl 

Thus each bitile P has a 'down-part' and an 'up-part' which are also dyadic. The partial 
order is also used for bitiles. Furthermore we write 

P < u P' ^ P u < P' u and P < d P' d 4 P d < P' d . 

A tree T is a collection of bitiles P for which there exist a top bitile T, which is not necessarily 
part of the collection, such that P < T f or all P e T. Note that in general a top of a tree is 
not uniquely defined. Similarly we say that T is an up-tree if P < U T for all P e T and some 
top T and a down-tree if P <d T for all P £ T. Trivially any tree can be decomposed into an 
up-tree and a down-tree: 

T = T„UT d/ T a = {PGT:P<„T}, t/= {P e T : P < Jj. 

The Rademacher functions are defined as 

n{x) := sgnsin(27i • 2 l x) = ^ (l 2 -i[jyc+i/2)(*) - h-'[k+i/2,k+i)( x ))- 
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If n e IN has binary expansion 

oo 

n = Y j n i 2 i l n ; e{0,l}, 

(=0 

we define the n-th Walsh function to be 

oo 

w„(x) := J^r,(x) ni . 

(=0 

Observe that we recover the Rademacher functions from the Walsh functions by means of 
w 2 i(x) = Ti(x). It is well known and easy to see that the restrictions {t0 n l[o,i)}ne)N form an 
orthonormal basis of L 2 (0, 1). 

With these definitions at hand we now associate to each tile P c 1R+ x R+ a wave packet 
Wp as follows. First we write the tile P with respect to its time and frequency components: 

1 

P = I P x co P = Ip x — [n, n + 1), Ip £ D, n e N. 
\m 

The wave packet w P is now defined as 

w P {x) := —^l Ip {x)w n (^-\ =: — T Wp (x). 
\I P \i V M ; \Ip\l 

Observe that Wp is L 2 -normalized while wp° is L°°-normalized, hence the superscript oo. 
The Haar functions are special cases of wave packets corresponding to the tiles of the form 
P = Jx|J|- 1 [l,2): 

Given a bitile P = P d UP u we use the notations zt>p u and w Pd for the wave packets associated 
with the up-part and the down-part of the bitile P, respectively. 

If a collection of bitiles arises from a single up-tree then the following lemma gives a 
very useful description of the wave packets in terms of the simpler Haar functions. This 
lemma is taken from [12, Lemma 2.2] where we also refer the reader for the proof. 

3.1. Lemma. Let T be an up-tree with top T. For all P £ T we have 

w Pd {x) = e PT ■ Wj u (x) ■ h Ip (x), 

whereWj is the L°° -normalized wave packet associated to the up-part of the top T and e PT € {— 1,+1} 
zs a constant factor that depends only on P and T. In particular we have that 

(f,w Pd )wp d = (f ■ Wj u ,h lp )h lp ■ Wj u . 

If 'T is a down-tree with top T we have a symmetric statement: 

W P U ( X ) = €PT m (X) ■ hj p (x). 
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4. Tile-type and cotype of a Banach space 



The notion of the tile-type of a Banach space X was introduced in [12] to show that if a 
Banach space X has finite tile-type then Carleson's theorem for Walsh-Fourier series is true 
for X-valued functions. More recently it was shown in [11] that a variant of the tile-type, 
adapted to the Fourier wave packets, also implies Carleson's theorem for the trigonometric 
system. Here we give a definition which is similar in spirit to that in [12, §3]. The main 
difference is that we only consider very special collections of trees in the definition of 
tile-type. These are essentially the trees generated by the selection algorithm in the size 
lemma, Lemma 7.2, and that lemma is the only place in the proof where the tile-type 
hypothesis is needed. In fact the reader is encouraged to briefly go through the statement 
and proof of the size lemma in order to gain some intuition on the definition that follows. 
The reason for giving this weaker but more complicated definition of tile-type is that it 
allows us to prove a partial converse of the variational Carleson theorem in Proposition 
5.2, namely that the r-variational boundedness of a Carleson-type operator implies that 
the space X necessarily has tile-type % for all % > r. 



4.1. Good collections of trees and tile-type of a Banach space. We now describe the 
collections of trees that we want to consider in the definition of the tile-type. Let 7~ = {Ty}y 
be a finite collection of up-trees, each consisting of finitely many bitiles, and set 

P := {P : P G Ty for some ;'} = UyTy. 

Denote by c(I) the center of some dyadic interval I e D. We will call the collection T 
u(p)-good if it has the following property: 

There is a reordering of the trees {Ty} ; and a choice of corresponding tops {T ; } ; such that 
{c(co T )}j is an increasing sequence and 



There is a symmetric definition of a good collection of down-trees, namely a collection T of 
down-trees will be called d(oivn)-good if there is a reordering of the trees {Ty}y and a choice 
of corresponding tops {T ; } ; such that {c(oj Tj )}j is a decreasing sequence and 



Ty = {P e P : P < u Tj and P £ T k for all k < /}. 



Ty = (P e P : P < d Tj and P £T k for all k < ;}. 



We say that a Banach space X has u-tile-type q if the estimate 




TeT PeT 
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holds uniformly for all u-good collections T. Similarly, we say that a Banach space X has 
d-tile type q if 



TeT PeT 



L'J(R + ;X) 



1 

< 



L1(R + ;X)/ 



uniformly, for all rf-good collections T . It is actually not hard to see that the two definitions 
of tile-type, namely the one given with respect to u-good collections and the one given 
with respect to d-good collections, are equivalent. This is the content of the following 
lemma. 

4.1. Lemma. A Banach space X has u-tile type q if and only if it has d-tile type q. 

Proof. Let us assume that X has u-tile type q and let T be a d-good collection of down-trees. 
Let us fix a choice of tops {T ; } ; ordered so that the sequence of centers {c(co Tj )}j is decreasing 
and 

Tj ■ = {P e P : P < d Tj and P £T k for all k < /}. 

Let P be the collection of all tiles in T and define 2 N := sup PeP sup cop. For any bitile P e P, 
P = Ip x |/p| _1 [n, n + 2), we define the transformation: 

P h» P := I P x [2 N - (n + 2)\I P \-\2 N - nlJpp 1 ). 

By the choice of N the transformation above maps bitilesP c IR+xR+intobitilesP c 1R+XR+ 
so that 

Pd = K P u = Pd. 

That is, the down-part of a bitile P transforms into the up-part of the bitile P and vice versa. 
Transforming the tops {Tj}j accordingly we obtain a collection T, consisting of up-trees, 
and a sequence of tops {T ; }y which together form a u-good collection. By the assumption 
that X has u-tile type q we thus get 

i 

TeT PeT 

Once this estimate is written down, the specific choice of tops {T ; } ; is not relevant any 
more. For each tree T, it is clear that we can choose a top Sj with |7 S .| = max ; |7 X .|. Thus 
the number |7g.| does not depend on j. Applying Lemma 3.1 to the tiles P belonging to the 
up-tree T with top S, we have 

wp d (x) = e PS ■ h Is (x) ■ wf(x) and w Pu (x) = e PS ■ h Is (x) ■ w^x). 

Define 2 m := 2 N |7 S | and note that this number does not depend on the specific choice of tree 
T. In order to derive a relation between w"? and let us write S = 1$ x |7 s | _1 [n, n + 2) so 
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that S d = I s x \I s \- l [n,n + 1) and S u = I s x |J s r 1 [2 m - n - l,2 m - n). We have 

Wf(x) = l Is (x)W 2 m-n-l(x/\Is\) = li s {x)w 1 m_ l {xl\l s \)lO n {xl\l s \) 

•J 11 

= w 2 m-i(x/\I s \) ■ Wg d (x) =: (/) P (x) • w™ d {x), 
where (pp is a unimodular function that depends only on the collection P. Thus 



L<!(R+;X) 



> 



L 

TeT PeT 



M(R+;X) 



wp«)wp„<t>r 



TeT PeT 



L'i(R+;X)j 



Replacing / by f§ v l in the previous estimate we conclude that X has d-tile type q. The 
proof of the reverse implication is completely symmetric. □ 

4.2. Remark. In view of Lemma 4.1 we will henceforth say that a Banach space X has tile 
type q whenever it has u-tile type or rf-tile-type q. We will also talk about good collections 
T without specifying whether we are talking about u-good or rf-good collections. Further- 
more, it is obvious that if a collection of trees can be split into a finite number k of good 
collections then the tile-type inequality still holds for the original collection with some 
different constant depending on k. We will then say that 7~ is a fc-good collection, or just a 
good collection if it is clear that the number k does not depend on anything interesting. 



The following lemma gathers some useful properties of good collections T and is the 
main ingredient in the proof of the partial converse in Proposition 5.2. 

4.3. Lemma. Let T = |T ; }^ 1 be a good collection ofup-trees and denote by P the set of all Utiles 
in T . Let {Ty}^ be the collection of the corresponding tops from the definition a good collection, 
ordered so that {c(co Tj )}j is increasing. We have the following properties: 

(i) The down-parts of the Utiles in T are disjoint: 
if P,P' G P and P ± P' then P' d nP d = 0. 



(ii) Let k(j,x) := max^Jl < k < j : I Tk 3 x} with the understanding that max0 := 0. For 
j G {1,2, ...,M} define the measurable functions Nj : 1R+ — > 1R+ as Nj(x) := c{ooT k{jx) ). Let us 
also set No(x) = for convenience. For any fixed x G R+ the sequence {Nj(x)}j is increasing. 
Furthermore, for each j and x G R+ we have 

{P G Tj : I P 3x} = {PeF: I P 3 x, Nj(x) G [co Pu ) and N H (x) i& P }. 



(Hi) For 1 < r < oo we have 



£j£//,w Pi > P J = ( J^\J^(f,w Pd )wp d (x)l {Nj 

j PeT, + ' J j PeP 



(x)e[ai Pu ), N H (x)(d) P ) 



dx. 
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Proof. For (i) observe that if two bitiles P,F e P belong to the same tree T G T then we 
always have Pd D P' d = since T is an up-tree. Suppose now that P G Ty, P' € Tjt where 
Ty, are two different trees in T . Assume for the sake of contradiction that Pd n P' d ± so 
that Ip n Ip/ ^ and cu Prf n a) P > d + 0. Then we have for example that cop d c <y Prf . However, 
since P, P' are different tiles we must actually have that cop c o^. Thus 

cut. c cup c cup' c co P > and Ip n Ip> c I T n I P < 
which implies that P' < Ty. We then get the following inequality for the centers of co Tj , co Tk : 

c(a) Tj ) < suptyj. < sup cop < sup co P > d = inf a)p' u < mfco Tku = c(co Tk ). 

By the definition of a good collection we thus have that ; < k so that P' ^ Ty, a contradiction. 

We now prove (ii). First note that for every fixed x G 1R+ the sequence {Ny(x)}y is increasing 
as a composition of increasing functions of /. In order to prove the main claim in (ii) we 
fix x G 1R+ and ; e {1 . . . ,M} and define the collections of bitiles 

S(j,x) := {P e Ty : I P 3 x), B(j,x) := {P e P : I P 3 x,Nj(x) e co Pu , N H (x) £ d) P ), 

where Nj(x) is as in the statement of the lemma. We claim that S(j, x) = B(j, x). If x ^ I T then 
both collections are empty: for S(j, x) this is because x i lj j □ Ip while for B(;, x) because 
Nj-i(x) = Nj(x) in this case. 

It remains to verify the claim when x £ lj j in which case Nj(x) = c{cot^) and Ny_i(x) e 
{0,c(o;r, ._ u) )}. 

Let P G S(/, x). Then x e I P c J T . and by the definition of the good collection we have that 
P <u Ty and P £T k for any k< The condition P < M Ty implies that Ny(x) = c(cu T/ ) e cup u . 
If x ^ ^£<j-ih e then Ny_i(x) = which is never in the interior of any frequency interval thus 
Ny-i(x) i (bp in this case. On the other hand if x e ^i<j-\h t we have that x e Ir,t _ lx) and 
P ^ Tk(j-i, x ) since - l,x) < /. Since x e I P C\ h HhU) ^ w e must have coj Kj _ U) a) P and 
thus Ny_i(x) = c(cuj )c(y _ li . ) ) g cup. This proves the inclusion S(/,x) c B(j,x). 

For the opposite inclusion assume that P £ B(j,x). Since x e It we have that Ny(x) = 
£(0;^.) G cup M which is equivalent to cur /M c oj Pu . Since x G I P n J x . this shows that P <„ Ty. 
Now it is not hard to see that P iLT e whenever t < ). Indeed suppose that we had P <T e 
for some t < j. This would imply that x G I P c I Tf and thus £ < - 1, x) < j. Furthermore 
we would have coj l ,coj i c cup so by the convexity of the interval cup and the fact that 
the sequence {c(cojj)}j is increasing we would conclude that Ny_i(x) = c(o;t ) , ._ U) ) g cup, 
contradicting the second condition in the definition of B(;,x). This proves the inclusion 
B(j,x) c S(j,x) and thus concludes the proof of (ii). 

Finally part (iii) of the lemma is an obvious application of the identity S(j, x) = B(j, x). □ 
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In the following paragraphs of this section we will investigate how the tile-type condition 
relates to the classical cotype of a Banach space. For this we will need to be able to view 
the dyadic intervals inside [0, 1) as the time intervals of bitiles of a suitable good collection. 
This is the content of the following lemma. 

4.4. Lemma. Let J be a finite collection of dyadic intervals in [0, 1) and define 2~ N := min 7e j- 

(i) LetT := {Ix[2 N+1 -2\I\~\2 N+1 ) : IeJ}. ThenT is an up-tree and J = {I P : P = I P xco P e T}. 

(ii) There exists a good collection ofup-trees T = {T ; }^ such that T = U ; T ; and 

{IeJ:\I\ = 2>~ N } = {I P :P = IpX(o P £ T ; }, < ; < N. 

(Hi) For 1 <r < oo and every f e L r ([0, 1); X) we have the identity 

N N 

Yj\Yj( w jf /Wp ^ wp ^\ r = Yj\ Yj <f> h i) h i( x )\ r > 

7=0 PeTj j=0 IeJ 

\I\=2-i 

where Wj is a unimodular function that depends only on the collection Sf. 

Proof. For (i) it is enough to notice that the tile T := [0, 1) x [2 N+1 - 2,2 N+1 ) satisfies P < u T 
for all P e T. Let us now show (ii). For IeJ' we set coi := [2 N+1 - 2|J|- 1 ,2 N+1 ) and for all 
< j <N we define the trees 

j r - {P = Ixcv I : IeJ, |J| = 2^'- N }. 
We define an appropriate top Tj for each tree Ty by setting 

Tj := [0,2) x [2 N+1 - 2 N ^,2 N+1 - 2 N ~i + 1). 
Suppose that P = I P x cop e T ; for some j e {0, 1, ... , N}. Then 

co Tj = [2 N+1 - UpI" 1 ^^ 1 - \Ip\~ 1 + 1) c cv Pu 

and obviously we always have that I P c I Tk . Thus each Tj is a top of T ; and hence each T ; is 
an up-tree. By construction the sequence {c(a>r ; )}/<N is strictly increasing and furthermore 
the intervals coj j are disjoint. We first show that the collection {T ; } ; < N satisfies 

Tj = {P e P : P < u Tj and P £ u T k for all k < j}, 

where P is the collection of all the bitiles contained in the trees Ty. Let P e Tj. We already 
saw that P < u Tj. Furthermore for k < j we have that sup co Tk ^ inf co Tj < inf co Pu which 
implies that P £ u Tk whenever k < j. This proves 

Tj c {P e P : P < u Tj and P £ u T k for all k < j}. 

Now assume that P e {P e P : P < u Tj and P £ u T k for all k < j}. Then we have P < u Tj 
thus co Tju c o) Pu which implies that 2 N+1 - IJpl" 1 < 2 N+1 - 2 N ~i <^ |7 P | < 2'~ N . We claim that 
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in fact |7 P | = 2^ N . Indeed, if \I P \ < 2 ( ^ 1)_N then we would get that co Tj _ u Q cop u and this in 
turn would give that P < u T,_i which is a contradiction. Since P 6 P and \Ip\ = 2>~ N we get 
that P 6 T;. 

Observe that if P € T ; for some j then by construction P n T\ = for all k < j — 1. Thus 
we have P £ u P for k < j -1. We now split the collection 7~ into two collections 
by setting say T\ = {T 2; } ; and 7~2 = {T^+i}; and each collection T v , v = 1,2, is good. This 
shows that the original collection 7~ is a 2-good collection. 

For (iii), remember that the trees Ty share a common top T = [0,1) x [2 N+1 - 2, 2 N+1 ). 
Thus, Lemma 3.1 implies that 

|^</,wp rf >wp rf (x)| = \^{fw™ ,hi P )WrK\ = \Yj(f w T' h ip) h i P \' 

PeTj PeT, IeJ 

which proves the claim in (iii) by setting Wj := Wj and replacing / by fvfl. □ 

Finally we recall the main result proved in [12] concerning the tile-type of an interpola- 
tion space X. Observe that by Lemma 4.3 the down-parts of all bitiles in a good collection 
T are disjoint; thus the following Proposition is identical to [12, Proposition 3.1]. 

4.5. Proposition. A necessary condition for tile-type q is that X is a UMD space and q > 2. If a 
UMD space has tile-type q, it has tile-type pfor all p e [q, 00). Every Hilbert space has tile-type 2, 
and every complex interpolation space [Y,H] e , e (0, 1), between a UMD space Y and a Hilbert 
space H has tile-type 2/6. 

4.2. Tile-type implies cotype. We observe in this paragraph that the hypothesis that a 
Banach space X has tile-type q implies that the space X has Rademacher and martingale 
cotype equal to q. We recall the relevant definitions. 

Let 2 < q < 00. We say that a Banach space X has (Rademacher) cotype q if 

(Ewf s|IL^II„ (l „,,„ 

;>0 j>0 

holds uniformly for all finite sequences {x ; } ; c X, where \rj\j are the Rademacher functions 
on [0,1). 

On the other hand, we say that X has martingale cotype q £ [2, 00] (or M-cotype q) if for 
all X-valued martingales {M n }„ we have 

(E^|M B -M M _i|«)^(supE|MJ^. 
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Every Banach space trivially has cotype and M-cotype oo. In general the notion of M- 
cotype is stronger than that of usual cotype but the two notions are equivalent in the case 
that X has the UMD property Finally we note that martingale cotype is equivalent to 
Haar cotype, meaning that it is suffices to consider Haar martingales in the definition of 
M-cotype. See [21]. 

The following proposition shows that tile-type implies M-cotype with the same index : 

4.6. Proposition. Suppose that the Banach space X has tile-type q > 2. Then X is UMD and has 
M-cotype q. 



Proof. The fact that tile-type q implies the UMD property is already contained in Proposition 
4.5 but we include a proof here for the sake of completeness. It will suffice to show that 



I/([0,1);X)- 



where Sf is any finite collection of dyadic intervals inside [0, 1), / £ U([0, 1); X), ej e {-1, +1} 
and r is some fixed exponent in (1, oo). Because of the following trivial estimate 



ei=-\ 



e,=l 



e,=-l 



it will actually suffice to prove that 



i/([0,i);X)- 



whenever ej e {0, 1}. However this amounts to showing that 



([0,D;X) 



< 



L'([0,1);X), 



lej' 



for any finite collection J"' of dyadic intervals in [0, 1). Consider the up-tree given by (i) of 
Lemma 4.4 applied to the collection J"': 

T := {I X [2 N+1 - 2|I|- 1 ,2 N+1 ) : I e J'}, 

where N is such that \I\ > 2~ N for all J 6 J'. By the tile-type hypothesis for the collection 
consisting of the single tree T and Lemma 3.1 we now get the UMD condition for Haar 
martingales with r = q. 

We will now show that X has martingale cotype q. By Lemma 4.4 we have for every 
positive integer N that 

N N 
;=0 \i\=2-k j=0 PeTj 
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where {Ty}^ is a good collection of up-trees. Since X has tile-type q the right hand side is 
controlled by ||/|b([o,i);X)- Thus 

l '([0,i);X)' 

0<k<N \I\=2~ k 

with the implicit constant not depending on N. This is the cotype condition for Haar 
martingale differences which by [21] is equivalent to X having martingale cotype q. □ 

4.3. Vector-valued Lepingle inequality. The variational Carleson theorem, in the scalar 
case, depends on certain jump inequalities originally due to Lepingle [16]. This fact has 
been recorded and well understood in several papers as for example in [10], [19], [14]. For 
the Banach space case that we are considering we will need the appropriate vector-valued 
extension proved by Pisier and Xu in [22]: 

4.7. Theorem ([22, Theorem 4.3]). Suppose that X has cotype t for all t > q. Then we have 
Lepingle's inequality for functions f £ L^(IR + ;X): 

I|E„/||lp(R +; V(X)) ^ ll/lb'(R + ;X)/ 

for allr > q and 1 < p < oo. 

By Proposition 4.6 one can replace the cotype t > q hypothesis in Theorem 4.7 by the 
hypothesis that X has tile-type x for all t > q. We will use this fact in what follows without 
further comment. 



5. Linearization of the Variational Carleson operator 

In this section we linearize the variational norm of the partial Walsh-Fourier sums of 
a function /, using more or less standard arguments as in [26], [12], [10] and [19]. We 
reduce the statement of Theorem 1.1 to an analogous statement about some closely related 
linearized versions of the variational Carleson operator which are more amenable to the 
time-frequency analysis techniques and interpolation. We carry the tile-type hypothesis 
throughout the section in the statements of our reduced theorems. 

For any collection of bitiles P we define the operator 
Q. rf(x) := sup Y | V </, w Pd )w Pd (x)l {N . e 

K,N <-<N K V y-f ' 

where the supremum is taken over all positive integers K and all non-negative real numbers 
N < N\ < ■ ■ ■ < N K - There is a symmetric version, denoted by Qp/(x), in which the 
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down-tiles are replaced by up-tiles and the condition in the indicator is replaced by Nj g 

d)p , Nj-i e (co Pd ], namely: 



These operators are formed over all bitiles, namely, one only requires Ip c [0, oo). 

Theorem 1.1 will be a consequence of the following theorem for C, p and its symmetric 
analog for C r y. 

5.1. Theorem. Let Xbe a Banach space with tile-type t for all z > q and P be any collection of 
bitiles. We have 

IIQp/IIlp(R + ;X) ^p,r,q II/IIlp(R+;X)/ 

whenever q < r < oo and < - < ^ - — . 

' p r r 

Concerning the proof of this Theorem, we focus on the operator C,. /P /(x), using in partic- 
ular the partial order on bitiles and their organization into trees, among other techniques. 
The main hypothesis is that the space X has finite tile type arbitrarily close to some number 
q. The reader here should prefer to think of the tile type hypothesis in the formulation 
given for families of up-trees, that is, in the equivalent formulation of the u-tile type. For 
the operator, C n pf, the proof is completely symmetric, in view of Lemma 4.1, where the 
role of the down tile Pd is analogous to that of the up-tile, and vice-versa. For all the con- 
siderations concerning the operator C r ,p/ we switch our point of view to the formulation 
of the rf-tile type. Bearing this in mind it is routine to adjust the arguments in this paper, 
given for the operator C r p/, in order to give the corresponding proof for the symmetric 
operator C r ,p/. We thus omit any further discussion concerning the proof of Theorem 5.1 
for the dual operator C r ,p/. 

We briefly describe how to conclude Theorem 1.1 from Theorem 5.1: 

Proof of Theorem 1.1. For integers < C < £'/ let Q^' be the maximal dyadic intervals 
oj c [C, C')- These intervals partition [C, C), and moreover we have 



This follows from [26, Corollary 8.3] and is a variant of the the formula [26, p. 68-69]. Now, 
let be those intervals co e Q c ^ for which co is the up/down-half of its parent. Let P"' c , 




Sc> f-S c f= J^j (f> w p) w p ■ 



P is a tile 
IpC[0,l), a> P eQ^/ 
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be the collection of Utiles such that cop, G Q"^, and I P c [0, 1). We then have 



Svf-S z f= Yj Yj(f' Wp ^ Wp ° 



ae\u,d) PeP° c , 



We have PgP^, if and only if I P c [0, 1), C i &p, and C G [cup,,), conditions in agreement 
with the conditions on N,-_i,Ny in the definition of C n p/. In the symmetric case, we 
have P G P" c , if and only if I P c [0,1), ( e (<vp d ], and C £ c^p- All together, for any K, 
Nq < ■ ■ ■ < Nk, we have 



/=1 a£{M}PeP^._ iiN . 



~ E E| E (f' Wp ^ Wp ° 
ce[u,d} ;=1 PeP^ ^ 

for some fixed collections of bitiles Pi and P2. Using Theorem 5.1, which is valid for 
arbitrary collections P, and the pointwise inequality just proved, completes the proof. □ 

As promised in the introduction, the variational formulation of the Walsh-Carleson 
theorem can give some intuition on the role of the tile-type hypothesis in a quantitative 
manner. One expression of this heuristic is that Theorem 5.1 is sharp as far as the tile-type 
hypothesis is concerned. 

5.2. Proposition. Let X be some Banach space and suppose that for any collection of bitiles P, the 
operator Qp, or the operator C rP , satisfies the conclusion of Theorem 5.1 with p = r: 

IIQp/lll/(R + ;X) ^ II/IIl'(R + ;X), 

whenever q < r < 00. Then X has tile-type x for all t > q and, a fortiori, X has cotype t for all 
t > q. 



Proof. We will prove the proposition assuming that the operator C rF is bounded on 
I/(IR + ;X) for all r > q. Let T = {T,} be a good collection of trees. By Lemma 4.3, (ii), 
there is an increasing sequence of integer valued functions {N,(x)} ; , such that 



i P^i 



L(R + ;X) 

^ HQp/Hl>(R + ;X) £r II/IIl»(r +; x)' 



L r (R+;X) 
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since C (/ p is bounded on I/(IR + ; X). If the hypothesis is true for C,,p we consider d-good col- 
lections of trees and show the corresponding statement for the rf-tile type. The conclusion 
then follows by using the analogue of Lemma 4.3 for rf-good collections. □ 



Following [10] we consider the linearized version of C r p given by 

m 

C P /(x) = C rAP /(x) := £ Yjj, 

Wp d )Wp d (x)l[N j (x)e[a> Pu ), Ny_i (x)id) P }^j (x), 

;=1 PeP 

where K, N%, . . . Nk : 1R+ — > N are arbitrary measurable functions and a = {aj}j is a sequence 
of X* -valued functions with Ey=f K'( x )l'' = 1- The expression for the operator C P can be 
simplified by writing 

K(x) 

PeP ;'=1 PeP 

where 

K(x) 

a P (x) := ^ l[Nj(xMa> Pu ), N hl {x)t&p} a j( x )- 

The operator C P depends on both r and the choice of the sequence a but we suppress this 
fact in what follows in order to simplify our notation. 

Note here that our assumption that X has tile-type x e (q, oo) can be replaced by the 
assumption that X has tile-type exactly q. This is because all our conclusions are given in 
terms of open intervals with respect to p, r and q. By restricted weak type interpolation and 
duality, the proof of Theorem 5.1 thus reduces to the proof of the following statement: 

5.3. Proposition. Suppose that X is a Banach space with tile-type q > 2. Let F,E c R+ be 
measurable sets with \F\, \E\ < +oo. Then there is a major subset E' QE with \E'\ > j\E\ such that, 
for all f : X — > R+ with \ f\ < 1 F , and all g : X* — > R+ with \g\ < 1 E ', we have 

\(C P f,g)\<\F\hE\7, 
whenever < - < £ - — . 

p r r 

5.4. Remark. Observe that 

\(C P f,g)\ < Yj(fr^p d )(w Pd a P ,g)\ = Y_ i e P (f,Wp d )(w Pd a P ,g) = \(C$f,g)\ 

PeP PeP 

for some choice of signs e P G {-1, +1}, where 
Cp/(x) := £p(f,w Pd )w Pd (x) a P (x). 



PeP 
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We will thus prove the estimate in Proposition 5.3 for the larger operator Cp", which we 
immediately rename again to Cp, and any finite collection of bitiles P. 



6. The tree lemma 

Let P be a finite collection of bitiles. The density of the collection P is 

density(P) := sup sup (— \ \g(x)\ r ' Y \aj(x)\ r 'dx) . 

Per P'>p\\h'\J Ip , j:N ^f ec0pl > 

We define the size of a collection P to be 

size(P) := sup (— \\Y\f,w Pi )wp i (x)^dx\ . 

TCP up-tree M^Tl J p-?* / 

6.1. Lemma (Tree lemma). For every tree T we have 

WgC T f\\ u<M+) = ||^<//Wp rf )wp rf fl P g|| LS(R+) < size(T) density (T)|I T | % 

PeT 

for all I < s < r'. 

We will prove the lemma for the case s = r' which, by Holder's inequality, implies the 
conclusion for 1 < s < r' as well. Let fj be the collection of maximal dyadic intervals 
contained in Ij that do not contain any Ip, PeT. The intervals in the collection J form a 
partition of I T thus 



H£Ct/IIi/(r + ) = (Yj I \Yj ep ^f' Wp ^ Wp ^ ap ^S( x )\ dx )'' 

le T J J PeT 



JeJ ^1 PeT 



J_ 



JeJ PeT 



We set for PeT and ; > 1 

A(P,j) := I P n{x: N hl (x) $ & P , Nj(x) £ [co Pu )}. 



We gather some auxiliary calculations in the following lemma: 

6.2. Lemma. Fix a tree T and a top T of T and consider the partition of It into the intervals 
J & ST- Let } £ $ and denote by the dyadic parent of }. There exist bitiles Q(J) £ T and 
P(J) = /(i) x co(J) such that: 
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(i) Q(J) < P(J) < T. 

(ii) For every j G [l,K(x)] we have the pointwise inequality: 1/1a(p,/) ^ 1{ X -. Nj(x)eco(j)\- 
(Hi) We have the estimate 

f Wt E Hx)fdx < |/| density (Tr'. 

Proof. Fix some / G $ . Since / is maximal with the property that it doesn't contain any I P , 
PeT, there is some bitile Q(J) G T such that Iqq) c /W, where / (1) is the dyadic parent of /. 
Observe that we must have / (1) c I T . Define the frequency interval co(J) with |cu(/)| = 2/|/ (1) | 
and such that co T Q co{]) c co QU) . Thus the bitile P(J) := / (1) x cv(J) satisfies Q(/) < P(J) < T. 
This proves (i). 

Now for all bitiles PeT such that Ip D / ^ we have / c I P by the maximality of / which 
implies that |7p| > |/ (1) |. For every such P we thus have + cot £ cop n cu(/) and so cup c cu(/). 
We conclude 

|J ^p £ ^(/)- 

PeT: 7 P n/^0 

Let x be such that lj(x)l A ( P/ j)(x) ^ 0. Then I P n / and N ; (x) G cup M c o) P c cu(/). The 
previous inclusion thus implies ljl A ( Pi j) < l[ x: N ; (x)eo>(j))- 

Finally we have P(/) > Q(/) and Q(/) G T. Thus 

= f wt E w |r ' 



dx 

j: Nj(x)ea> m 

< l/l density (T)^ 

by the definition of density □ 



For t G {u = up, d = down} define the functions 
F J/T (x) := lj(x)g(x) e P {f,Wp d )wp d {x)ap{x) 



PeT T 
Ip37 



K(x) 



= E E ep ^' ^kw^w- 



PeT T ;=1 

fraj 
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Since every tree T can be written as a union of its up-part T u and its down-part T rf , we 
have the estimate 

\\ g c T f\y {R+) < (2> 7 X, (R+) f + (2>x*< J* • 

We estimate the two terms appearing in the previous sum separately. 
6.3. Lemma. We have 

(E'^'kaJ* * |I#density(T)size(T). 

Proo/. The function Fj^ can be written as 

oo 

Fjr d (x) = lj(x)g(x) ^ep(//Wp (; )wp d (x)fl / (x)l [i>) (K(x))l A(P/;) (x). 

PeT d ;=1 

Now consider two pairs (/, P) + (/', P') appearing in the previous sum, with /, f > 1 and 
P,P' e T d such that I P , 1 P , 2 /• We claim that A{P,j) n A(P',f) = 0. Indeed, since T A is a 
down-tree we have co Td Q 0Jp d n &>p' ^ and thus the bigger interval contains the smaller. 
For example we have cop^ C cup^ => cup, C &> p . 

If /' < j y < j - \ and x G A(P,/) then we have (x,N ; (x)) G P u and (x,N/_i(x)) £ P. 
Since Nj(x) > Ny_i(x) we get that necessarily N ; _i(x) < min co P . Thus 

N ; v(x) < Nj-i(x) < min cup < mma) P > => Nj>(x) £ a) P > u => x £ A(P',f). 

Suppose now that /' >/<=>/<;'- 1. If x G A(P', /') then (x, N ; v(x)) G P^ and (x, N ; v_i(x)) £ 
P. Since Ny(x) > N ; v_i(x) we must have N ; v_i(x) < min <u F . Thus 

Ny(x) < N ; v_i(x) < mincup = min co P > d < mina;^ => N ; (x) £ o>p u => x £ A(P,;). 

In every case we get that A(P,j) n A(P',f) + => / = /'. However all the up-parts P u , 
P G Td, are disjoint since is a down- tree, so we cannot have N/(x) G cyp a n wpj with P + P' . 
We conclude that A(P, /) n A(P', /') + => (P, ;') = (F, /') as claimed. 

The disjointness property of the A(P, j)'s, the definition of size and (ii) of Lemma 6.2 
imply that for any fixed x 

l(/>p d >l 

|F //d (x)| < |g(x)| sup sup — |fl / (x)|l J (x)l 4(P/ ; ) (x) 

PeT d l<j<K(x) \I P \2 

< size(T)\g(x)\ sup |^(x)|l {3/: N .( y)auU)} (x) 

l<j<K(x) 
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< size(T)|g(x)| ( £ \dj(x)fY. 
Integrating the previous estimate raised to the power r' yields 

H^jl&'PM * size ( T ) r ' f W*)!'' E W'** * l/l size ( T ) r ' density(T)'-', 
by Lemma 6.2, (iii). Since the collection $ partitions I T , summing in / G gives 

(£ll%ll^ (K+ / * size(T)density(T)( = |7# size(T) density(T), 

as we wanted. □ 

The proof for the up-part is more involved: 
6.4. Lemma. We have 

(L'^W/ * N^density(T)size(T). 

Proof. We fix some J & ST and xe/so that Fj M (x) ^ 0. We use Lemma 3.1 in order to write 
the function Fj rli in the form: 

F/,„(x) = lj(x)g(x)wj u (x) J^j E e i ,e ^</rWp i >/zj p (x)fl ; (x)l^ (P/; - ) (x), 

1 </'<£(*) PeT„ 
/: Nj(x)eco(J) Ip2j 

where epepp G {-1, +1} and is unimodular and depends only on the upper tile of the 
top T of T. Now for every 1 < j < K(x) with Ny(x) G co(J) , consider the inner sum 

^ e P e PT (f, Wp d )h Ip (x)aj(x)l A(P/j) (x). 

PeT„ 

The frequency intervals oj Pii , P G T !( , all contain the top interval co Tu and thus they are 
nested. Consider P, j such that l A (p^(x) ^ 0. We have N ; _i(x) £ &>p and Nj(x) G cup u . For 
all bitiles P' G T„ with | Jj=>, | < \I P \ we have that co Pu c a) P > u and hence, that Ny(x) G co P > u . 
Likewise, for all bitiles P' with \I P >\ > \I P \ we have co Pll □ co P > u and hence, that N ; _i(x) £ cup^. 
For fixed / G and for every x G / and 1 < j < K(x), there are thus some dyadic intervals 
/ c J sma11 c J larse c I s ™ 1 } and we have 

' X,] X,] X,J+1 

|^ e P ep T </ / a7p (i >%(x)fl;(x)l^ ( p /;) (x)| = | ^ epe PT </,a;p iJ >/ij p (x)fl / (x)|. 

PeT„ PeT„ 

Ip2j 7 small cl„c7 larse 

x,j v ' - x,j 
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< 0; 



(x)W^(x)(EpmaIl - E ; large)( V e P £ FT (f , Wp d )h Ip )(x)\ 
x,j x j V ( i f 

PeT„ 

fl 7 (x)(EpmaU - Elarge)^ V </, Wp d )Wp d )(x)\, 

X ') X,j v 7 



PeT„ 

since |w^J = 1. 

Denote by E^/ the martingale of dyadic averages of / with respect to dyadic intervals 
of length 2 l '. Summing in j G [1, K(x)] for which Nj(x) G co(J) and using Holder's inequality 
we get 

\F J/U (x)\ < lj(x)\g(x)\ Y \aj(x)(E pma n -E ^e)^ Y{f,w Pd )w Pd ){x)\ 

l<j<K(x) PeT u 
j: Nj(x)eco(J) 

j_ 1 

<l,(x)\g(x)\( Yj W)\ L |(E, ; -E^)(w;^(x)| r ) r / 

l<j<K(x) l<j<K{x) 
j: Nj(x)eco(J) j: Nj(x)ew(J) 

where £ v t 2 ,..., l' v £' 2 ,..., are integers with |/| < 2 £l < 2^ < 2 t2 < 2% < . . . < 2 e > <2 f ' < 
and / := Zp e T u e p(f> w P d ) w P d - Taking the supremum over all such choices of integers €j,£j' 
f or 1 < ; < K and all positive integers K and integrating over / gives the estimate 

f\Fj, u (x)\ r 'dx< f\g(x)f V \ aj (x)\ r )\E k (wlf)(xW v Ax. 
Here we denote 

\\b k \W: k ■.= sup sup (Vll/V-/Mf • 

2> - A k eo<t!<-<e K 

2 e '>A, j=0,...,K 

The function IIE^w^ f(-)ll^ is constant on / thus 

1 l k >\1\ 

< |/| density(T)'"' inf [m(|| E fc (w;~ /)(-)lk)f 

< density(T)^' JT[m(|| E fc (iD5?/)(-)lkr)(y)fdy / 
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where in the second inequality we also used Lemma 6.2, (iii). Now we sum over J e J~ to 
get 

(E 1 ^ 11 ^/ ^density(T)( f [M(|| E k (wlf\)\W,{y)f dyY 
< density(T)( J [|| E k (wlf)(y)\W r j' dyj 



I I T 

W - . . .. ». H 



< density (T) 1 1/1 | L / (R+) < density OOllE^ 07 ^ 07 ^!!^^) 



PeT„ 



< \I T \ 7 density (T) size(T), 

since r' < q' < 2 < q. In (*) we have the vector-valued Lepingle inequality from Proposition 
4.7 while in (**) we use the UMD property of X. □ 



7. The size and density lemmas 

Let X be a UMD Banach space with tile-type q > 2. In this section we recall the standard 
selection algorithms in terms of density and size. In terms of density we have: 

7.1. Lemma (Density lemma). Let P be a finite collection ofbitiles and 5 > 0. Define density(P) 
with respect to some function g : R+ — > X* with \g\ < 1 E , where E c R+ is a measurable set of 
finite measure, and a sequence {fly(%)}y with £ • \aj(x)f = 1. There exists a decomposition 

P — Psparse U Ty, 

7 

w foere each Ty is a free and 

density(P sparse ) < 2" 6 density(P), £ |I T .| < 2 5/ density (P)-'"|E|. 



Proof. We choose P spa rse to be as big as possible 

Psparse :=freP: sup f \g(x)f V |fl 7 (x)r'dx<2-'-' 6 density(P) 1 - 

For every P 6 P \ P spa rse there exists a bitile P' such that 



|J P |<2'' 6 density(P)-'" f \g(x)\ r ' £ \ aj (x)fdx. 

j: Nj(x)ea>p, 
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Among the chosen bitiles V let us call {7\}jt the bitiles that are maximal with respect to the 
partial order '<'. Now set 

:= {P G P sparse : P < T k }. 

It is clear that P \ P sp arse = U/TV We have 



Y H\ < r' b density^ YYj f WtH^thy- N jiy )ea, Tk] (x)dx 

k k j ^\ 

< l r ' b density(P)- r ' V Y f \aj(x)fdx. 

j k <Jk k nEr\\y.Nj(y)eQ)T k ) 

Now, for j fixed, the sets lj k n E n {y : NAy) G curj/ G Z, are all contained in E and are 
pairwise disjoint for different k's. Indeed if two of them intersected, say for k\ ± k 2 , then 
the corresponding bitiles T^, Tt 2 would also intersect, which contradicts their maximality 
Summing first in k and then in j we get 

Y UtJ < 2 r ' 6 density(P)" f ' Y f \ aj (x)\ r 'dx < 2 r ' 5 density (P)- r '|E|, 

k j J £ 

since Yjj \ a j(x)\ r ' = 1- n 

For the selection by size we prove a version of the standard size selection algorithm: 

7.2. Lemma (Size lemma). Let P be a finite collection of bitiles and X a Banach space of quartile- 
type q > 2. Define size(P) with respect to some function f G L I? (R+;X) and suppose that 
size(P) < a. There exists a disjoint decomposition 

P = Psmall U [J Ty, 

7 

where each Ty is a £ree and 

size(P small ) < T\o, Y N £ ff _,? ll/llJ« ; ( R+; xr 



7 

Furthermore, if\f\ < If for some measurable set F c 1R+ of finite measure, we have the following 
BMO-bound for the top time intervals: 



\y 



I / - A 7 r . 

7 



BMO 



where BMO stands for the dyadic BMO on the positive real line. 



The selection algorithm is contained in [12, Proposition 6.1] but we briefly outline it here 
for the reader's convenience. Furthermore we prove the last statement which is standard, 
but not yet recorded in the vector-valued setup. 
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Proof. For every tree T we set 



PeT„ 

where T is a top of T. We iterate the following selection algorithm. Consider all maximal 

trees inside P with A(T) > o2~^. Among them let Ti be one with top T\ whose frequency 
interval (Oj x has minimal center. Replace P by P \ Ti and iterate. When no trees can be 
selected any longer the remaining collection, P sma u, by definition satisfies size(P sma u) < 

T~<*o. Let {Jj}j be the selected trees. The top time intervals I Tj of the selected trees can be 

thus estimated by 



W(R+;X) 



The collection T := {Tj, u }j of the selected up-trees is a good collection by construction. Thus 
the tile-type property of X implies that the sum on the right hand side of the previous 
estimate can be estimated by ||/||y( R . x) - We get 



L n 



< o~ 



Li(R+;Xy 



It remains to prove the BMO-bound for the sums of top time intervals. Let / be a dyadic 
interval. We calculate 



dx 



j: I T .QJ PeT jiU 



\L1{R+;X) 



by another application of the tile-type of X. Now |/| < 1 F implies ll/ljlly(R + x) - 1^ n /I — Ul- 
Taking the supremum over dyadic intervals / completes the proof of the BMO estimate. □ 



We record some additional size estimates that will allow us to obtain some initial control 
on the size of the collections we will consider. 

7.3. Lemma. Let P be any collection of Utiles and define the density with respect to some function 
g : IR+ — > X* with \g\ < 1 E and the size with respect to some function f : R+ — > X with \f\ < 1 F . 
We have that density (P) < 1 and size(P) < 1. 
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The proof of the density estimate is completely trivial while the proof of the size estimate 
is in [12, Lemma 7.1] and relies on the UMD property of X. The following lemma is a 
BMO-type of estimate and is contained in [12, Lemma 7.2]. 

7.4. Lemma. Let C {I e T> : inf y6 /M/(x) < A} be a finite collection of dyadic intervals and K 
be a dyadic interval. Then 



IcK 

for all 1 < p < oo. 



<A\K\p, 

L p (R + ;X) 



8. Proof of Proposition 5.3 



We need to prove that for every pair of measurable sets E, F with |£|, \F\ < +oo there exists 
a major subset E' c E, |E'| > \\E\ such that 

\(C v f,g)\<\F\\\E\7 

for i < p - and for all |/| < If, |g| < 1e'. By dilation invariance we can assume that 
1 < |E| < 2. 

Case 1: \F\ < 1. Let 

G := {M(1 F ) > 4|F|}. 
Then |G| < | thus E' := E \ G can be taken as a major subset of E. We have 

(c F f,g) = 2^(/,w Prf )<wp d flp,g) = Yj + Yj ■ 

PeP PeP PeP 

l P £G IpQG 

The second sum vanishes since each Wp d is supported on I P C G while g is supported on 
E' c G c . Hence it suffices to consider any collection P' c {P e P : I P G} and prove the 
corresponding bound for Cp> in the place of Cp. For any such collection P' a standard 
argument using Lemmas 3.1 and 7.4 shows that we always have the bound 

size(P') < U < |f |. 

For the details of this argument see for example the proof of [12, Lemma 7.3]. Since P' is a 
finite collection of bitiles there exists some positive integer n such that 



density(P') < 2~ |E|, size(P') < 2~ \F\ 



no 1 
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Iterating the density lemma with 5 = 4 and the size lemma we write P' as a union of trees 

p ' = UU T »- 

neZ / 

such that 

density(T„, ; ) < 2*\E\, size(T, v ) < l1\F\K and £ |I Tj J < 2~ n . 

i 

Recalling the trivial estimate density(P) < 1 of Lemma 7.3 for any collection P we thus 
have 

density(T n , ; ) < nun(l,2?), size(T w , ; ) < min(|F|,2? |F|»). 
Applying the Lemma 6.1 with s = 1 we get 



l<Cp/,g>|= ( ^ep</,Wp rf )wp, ( (^K(%(^¥ 
<J PeP 

< ^ ^ J ^ £p(f / Wp i )wp d (x)a P {x)g(x)dx 

< ^ ^ l^r„J size(T„, ; ) density(T n , ; ) 



< £rrun(|F|,2i|F|?)mm(l,2^)£ \l Tn 

neZ ; 

< V T n min(|F|, 2? |F| * ) min(l, 2*). 



We estimate the previous sum as follows: 

\(C P f /g )\<\F\^ Yj 2" ( H) + |f| £ 2*H> + |fl £ 2-" 

A A n:l<2» 

re: 2"<|FK »: <2"<1 

<; IFI 1 "^ + |F| <; |F|*. 

In the last approximate inequality we used 1 — ^- = p--^>^ and the hypothesis |F| < 1. 

Case 2: |F| > 1. Let P be any finite collection of bitiles and set E' := E. By the trivial size 
and density estimates of Lemma 7.3 we have 

density (P) < 1 < 2$\E\, size(P) £ 1 < 2?|F|». 
We apply the density lemma and the size lemma recursively to decompose 

p =UU T -/ =: U r - 

n>0 j n>0 
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where each T n is the union of the trees, {T n ,j}j, and for all n e IN we have 
density^) < 2~? , size(T n ) < min(l,2~f |F|*), 



and 



II' 



■I T 



n,] 



n\-r\-\ 



BMO 



< 2"\F\ 



By interpolation of the last two bounds we get for any 1 < t < oo: 



For n fixed we estimate 



|< c r„/,g>| < ^ J lir n , y ( x )| X ep</,Wp d >ro P(j (x)g(x)flp(x)|& 



< 



1 J_ 
i ; d,-t 



A • B, 



v' 



i ~ i PeT„,j 

for t > r. We note that the function inside the norm in B is supported in £ and t' < r' . 
Thus, Holder's inequality and Lemma 6.1 imply 



B<\E\'' f\Y e P (f,Wp d )wp d ga P 

< sizetrjdensitytT;)^ Ihj) 

< min(l,2"f|F|?). 

Using (8.1) we estimate A as follows 



y> 1 



I/' 



A = 



E^J^( 2n|i:r ^)' =2?|i:| " H 



Gathering the estimates and we get for all t > r 

\(C T J f g)\<2^mm(h2-"m\F\-^. 
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Summing in n > thus gives, 

n>0 n: 2"<\F\ n: 2">\F\ 

if we choose t so that t > max(p, r), since \F\ > 1. We have also used the hypothesis r > q 
in penultimate estimate. 

This conclude the proof of Proposition 5.3 and thereby of Theorems 5.1 and 1.1. 



9. Interpolation 

In this section we discuss the proof of Theorem 1.2. This is a simple interpolation 
argument between the Hilbert space-valued case for the r-variation and the UMD valued 
case for the oo-variation, that is the main result from [12]. 

Proof of Theorem 1.2. Let X be a UMD Banach space of the form X = [Y,H] e for some 
< 6 < 1 and set q =: 2/6. From [12] we have that the Carleson operator maps L p ([0, 1); X) 
into itself for all 1 < p < oo. This can be rewritten in the form 

I|Sn/IIlp([0 / 1);^(X)) ^ ll/lb([0,l);X)/ 1 < p < OO. 

On the other hand we have for any Hilbert space H the following variation norm Carleson 
theorem: 

IIWH^([ai);^ s (H)) ^ II/IIlp([0,1);H), S > 2, s' < p < oo. 

This follows for example from Theorem 1.1 although in the special case of a Hilbert space 
one could just repeat the scalar proof. We first use the reiteration theorem as in [1, Theorem 
3.5.3], to write 

X = [Y,H]e = [[XHh,HU 

with 6 = (1 - co)5 + co. In our considerations one should think of 5 — > 0. Now fix 1 < p < oo 
and interpolate between I/([0, l);f°([Y,H] 6 )) and LP([0, l); r V s (X)). As in [22, p. 501], we 
have 

[r°([Y,H] 6 ),^ s (H)L c r»([[Y,HLH] ffl ) = ^'(X), 
where ^ = ^ + j <=> s a = s/co, and 6 = (1 - to)b + co. From this we get that 

IISN/llLf'ttaDrV'XX)) ^ ll/lb([0,l);X) 

whenever r > 2/6 = q and < - < - □ 
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9.1. Remark. Let us rephrase Theorem 1.2 for the integrability index p being fixed. 

First suppose that 2 < p < oo, so we are in the local L 2 -case. Then Theorem 1.2 says that 
the partial sums map L p ([0, 1); X) to L p ( < V r ([0 / 1); X)) whenever r > q since, in this case, the 
condition ^ < j, — is automatically satisfied. Here Theorem 1.1 gives exactly the same 
range. 

In the case p < 2 we get that the partial sums map U ([0, 1 ); X) to V ([0, 1 ); 'V'(X)) whenever 
2r/q > p' <=> r > |p'. This should be contrasted to the weaker statement of Theorem 1.1 
which in the case p < 2 gives the corresponding result only for the range r > (q-l)p' > |p'. 
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